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Subhash; Zegura, Ellen W. 

Corporate Source: College of Computing Georgia Institute of Technology, 
Atlanta, GA 30332, United States 

Source: IEEE Journal on Selected Areas in Communications v 20 n 7 
September 2002. p 1328-1344 

Publication Year: 2002 

CODEN: ISACEM ISSN: 0733-8716 

Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
Journal Announcement: 0210W1 

Abstract: In the current Internet, Web content is increasingly being 
cached closer to the end user to reduce network and Web server load and 
improve performance. Existing Web caching systems typically cache entire 
Web documents and attempt to keep them consistent with the origin server. 
This approach works well for text and images; for bandwidth intensive 
multimedia data such as audio and video , caching entire documents is not 
cost effective and does not scale. An alternative approach is to cache 
parts of the multimedia stream on different caches in the network and 
coordinate stream playback from these independent caches. From the 
perspective of the clients, the collection of cooperating distributed 
caches act as a single fault tolerant, scalable cache. In this paper, we 
focus on data placement and replacement techniques for such cooperating 
distributed caches. Specifically, we propose the following new schemes 
that work together. 1) A family of distributed layouts, consisting of 
two layouts, namely RCache and Silo. The RCache layout is a simple, 
randomized, easy-to-implement layout that distributes constant length 
segments of a clip among caches and provides modest storage efficiency. 
The Silo scheme improves upon RCache; it accounts for long term clip 
popularity and intraclip segment popularity metrics and provides 
parameters to tune storage efficiency, server load, and playback 
switch-overs. 2) Rainbow, a local data replacement scheme based on the 
concept of segment access potential that accurately captures the 
popularity metrics. 3) Caching Token, a dynamic global data replacement or 

redistribution scheme that exploits existing data in distributed 
caches to minimize data distribution overhead. Our schemes optimize 
storage space, startup latency, server load, network bandwidth usage , 
and overhead from playback switch-overs. Our analytical and simulation 
results show that the Silo scheme provides three to eight times higher 
cache hit ratio than a comparable traditional Web caching system that has 
the same amount of storage space. 30 Refs. 

Descriptors: Cache memory; Fault tolerant computer systems; World Wide 
Web; Servers; Multimedia systems; Video on demand; Distributed 
computer systems; Bandwidth; Computer simulation 
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Abstract: Employing distributed multimedia applications (DMA) 
requires management support for multiple configuration steps including the 
definition of a desired DMA topology, the specification of a desired 
quality of service (QoS) and its enforcement through resource reservation. 
In this paper, we examine the additional aspect of finding an appropriate 
placement for a DMA within a distributed computer system (DCS) . An 
overall approach is described for interrelating placement functions with 
existing procedures for topology and QoS specification and resource 
reservation. Then the problem of assigning a DMA within a DCS is formulated 
with the goal of finding a DMA placement with minimized computation and 
communication cost. For solving the assignment problem an efficient 
heuristic algorithm - SIGMA - is presented. Unlike other approaches, SIGMA 
takes into account requirements, which are specific for multimedia 
applications. Based on experiments conducted for randomly generated DMA and 
DCS graphs, the efficiency and accuracy of SIGMA is shown to be encouraging 
because, at low execution times, it finds assignments with cost very close 
to the optimal one. (Author abstract) 39 Refs. 
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Corporate Source: Foundation for Research and Technology-Hellas, Crete, 
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Source: IEEE Transactions on Circuits and Systems for Video Technology v 

8 n 1 Feb 1998. p 13-17 
Publication Year: 1998 
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Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
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Abstract: In distributed multimedia servers where client requests for 
different video streams may have different probabilities, placement of 
video streams is an important parameter because it may result in unbalanced 
requests to the system's stations, and thus to high blocking probabilities 
of requests. We present a method, MMPacking, to balance traffic load and 
storage use in a distributed server environment. Since different video 
streams are requested by clients with different rates, video stream 
replication is used to balance the traffic patterns of the stations; thus, 
the requests and I/O usage of the stations are balanced, since replication 
allows requests for the same video stream to be routed to different 
stations. MMPacking achieves load balancing by producing at most N minus 1 
replicas of video streams in a system with N servers. These replicas are 
distributed among the stations so that storage balancing is achieved as 
well, since no station stores more than two video streams more than any 
other station in the system. (Author abstract) 8 Refs. 

Descriptors: ^Digital communication systems; Algorithms; 
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Distributed computer systems 
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This dissertation presents and evaluates several new algorithms that 
improve the performance of distributed file systems by migrating or 

copying files as necessary between system nodes. The results are based on 
analysis of file reference patterns from three commercial installations, 
modeling, and trace driven simulation. 

The first part of the dissertation is an exploratory analysis of how 
shared user files are referenced. We find that although few files are 
shared, they are opened frequently, and account for a large fraction of the 
I/O traffic to user files. The reference pattern to shared files is not 
easily characterized, and varies widely among files. A batch Poisson 
process with geometric batch size is determined to be the most appropriate 
model . 

Based on the exploratory analysis, we developed several algorithms for 
file migration and replication. The algorithms evaluated include those 
based on our file reference pattern analysis, as well as simple strategies 
such as static placement and movement on reference, and optimal 
look-ahead migration and placement . We found that only a few files should 
be migrated or replicated, but replication or migration can substantially 
reduce the network traffic (up to 63% for replication and 36% for 
migration, relative to static placement ) . 

A policy based on a batch Poisson process with geometric batch size 
has the best performance when replication is not allowed. It uses as 
decision variables the fraction of a file accessed per open, the number of 
references from a user, and the number of changes in locality. 

By replicating files , the network traffic can be reduced further 
compared to migration alone (up to 42%) . Whether the additional copies 
should be invalidated or updated when the file is updated depends on the 
installation and the rules for placing users at nodes. The algorithms with 
the best performance use the average reference rate, the number of 
consecutive opens in update mode, and the time since the node started using 
the file as the decision variables. By comparing our realizable algorithms 
with optimal unrealizable algorithms, we show that it is unlikely that 
other migration or replication algorithms can achieve a substantially 
better performance. 
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Conference Sponsor: IEEE Comput. Soc. Tech. Committee on Distributed 

Process 

Conference Date: 31 May-4 June 1999 Conference Location: Austin, TX, 
USA 

Language: English Document Type: Conference Paper (PA) 
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Abstract: The paper proposes a protocol suite for dynamic replication and 
migration of Internet objects . It consists of an algorithm for deciding 
on the number and location of object replicas and an algorithm for 

distributing requests among currently available replicas. Our approach 
attempts to place replicas in the vicinity of a majority of requests, 
while ensuring at the same time that no servers are overloaded. The request 

distribution algorithm uses the same simple mechanism to take into 
account both server proximity and load, without actually knowing the 
latter. The replica placement algorithm executes autonomously on each 
node, without the knowledge of other object replicas in the system. The 
proposed algorithms rely on the information available in databases 
maintained by Internet routers. A simulation study using synthetic 
workloads and the network backbone of UUNET, one of the largest Internet 
service providers, shows that the proposed protocol is effective in 
eliminating hot spots and achieves a significant reduction in backbone 

traffic and server response time at the expense of creating only a small 
number of extra replicas. (35 Refs) 
Subfile: B C 
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Abstract: The large size, hard real-time constraints, and general 
complexity of a video-on-demand system lead to many difficult design 
decisions that may affect the system's ultimate cost and performance 
substantially. The primary design problems of the video-on-demand system 
are real-time disk scheduling, buffer management, prefetching, movie 
allocation, disk striping, device bandwidth reduction and others, where the 
movie allocation determines the number of movie copies and the placement 
of movie copies in video servers. The movie allocation among these problems 
is important because it controls an efficient usage of space and 
bandwidth on video servers . In this paper, we study the movie 

allocation for distributed video -on-demand systems, and propose a movie 
replication algorithm considering space and movie popularity, and a movie 
allocation algorithm which has a new movie selection policy, grouped 
alternation and a new movie placement policy, SCAN. We design two movie 
allocation algorithms: grouped alternate round-robin and grouped alternate 
SCAN, where the proposed algorithms allocate a group of movies instead of 
one movie to video servers at each round. The performance of the proposed 
algorithms is evaluated through simulations. Simulation results show that 
the performance of the proposed algorithms is better than that of other 
algorithms in terms of load balancing among video severs * bandwidths and 
blocking probability for a movie request. (12 Refs) 

Subfile: B C 
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Abstract: We define and formulate various policies for load management in 
distributed video servers. We propose a predictive placement policy 

that determines the degree of replication necessary for popular videos 
using a cost based optimization procedure based on a priori predictions of 
expected subscriber requests. For scheduling requests, we propose an 
adaptive scheduling policy that compares the relative utilization of 
resources in a video server to determine an assignment of requests to 
replicas. To optimize storage utilization, we also devise methods for 
dereplication of videos based on changes in their popularities and in 
server usage patterns . Performance evaluations indicate that a load 

management procedure which uses a judicious combination of the different 
policies performs best for most server configurations. Advances in storage 
technologies are making high performance video servers a reality. These 
video servers are being deployed over emerging broadband networks to 
deliver a variety of interactive, digital video services to thousands of 
residential subscribers. To meet the scalability requirements in such large 
deployments, distributed video server architectures are being 

considered (M. Buddhikot and G. Parulkar, 1995) . We propose various methods 
for load management that are targeted at improving the cost effectiveness 
of distributed video servers. (8 Refs) 
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Descriptors: broadband networks; interactive video; network servers; 
resource allocation; video equipment 

Identifiers: load management; distributed video servers; predictive 
placement policy; replication; popular videos; cost based optimization 
procedure; a priori predictions; expected subscriber requests; scheduling 
requests; adaptive scheduling policy; relative resource utilization; 
broadband networks; digital video services; residential subscribers; 
scalability requirements; distributed video server architectures; cost 
effectiveness 

Class Codes: B6430H (Video recording); B6210L (Computer communications); 
C5620 (Computer networks and techniques); C5690 (Other data communication 
equipment and techniques); C6150J (Operating systems) 

Copyright 1997, lEE 



16/5/17 (Item 6 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

4603370 INSPEC Abstract Number: B9404-6210L-014 , C9404-5620-008 
Title: A case for caching file objects inside internetworks 

Author (s): Danzig, P.B.; Hall, R.S.; Schwartz, M.F. 

Author Affiliation: Dept. of Comput. Sci., Univ. of Southern California, 
Los Angeles, CA, USA 

Journal: Computer Communication Review vol.23, no. 4 p. 239-48 
Publication Date: Oct. 1993 Country of Publication: USA 
CODEN: CCRED2 ISSN: 0146-4833 

Conference Title: SIGCOMM *93. Communications Architectures, Protocols 
and Applications 

Conference Date: 13-17 Sept. 1993 Conference Location: San Francisco, 
CA, USA 

Language: English Document Type: Conference Paper (PA); Journal Paper 

(JP) 

Treatment: Experimental (X) 

Abstract: The paper presents evidence that several, judiciously placed 
file caches could reduce the volume of file transfer protocol (FTP) 
traffic by 42%. and hence the volume of all NSFNET backbone traffic by 
21%. In addition, if the FTP client and server software automatically 
compressed data, these savings could increase to 27%. The authors believe 
that a hierarchical architecture of whole file caches, modeled after the 
existing name server's caching architecture, could become a valuable part 
of any internet. They derived these conclusions by performing trace driven 
simulations of various file caching architectures, cache sizes, and 
replacement policies. They collected the traces of file transfer 

traffic employed in their simulations on a network that connects the 
NSFNET backbone to a large, regional network. This particular regional 
network is responsible for about 5 to 7% of NSFNET traffic. While the 
present paper's analysis and discussion focus on caching for FTP file 

transfer , the proposed caching architecture applies to caching objects 
from other internetwork services. (18 Refs) 
Subfile: B C 
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Make Web caching pay off — You know Web caching saves time and 
resources, But what's the best caching solution for your enterprise? 
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ISSN: 1046-4468 
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Document Type: Articles, News & Columns 
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Discusses Web caching solutions. Describes how Web caching works. 
Reports that worldwide, caching has caught on, noting that estimates of 
cacheable content online are now near 80 percent. Says that typical 
caching configurations are the familiar cache-enabled proxy and the 
transparent cache. Explains that cache-enabled proxy is similar to a proxy 
server, except that Web content is stored on a random-access-memory (RAM) 
store or disk. Explains that most proxy caches also cache File 
Transfer Protocol (FTP), Gopher and Network News Transfer Protocol 
(NNTP) traffic . States that transparent caches intercept all outbound 
network traffic , including Hypertext Transfer Protocol (HTTP) . Mentions 
three standard ways to place these caches inline, namely, as an Ethernet 
bridge, as the network* s configured default gateway, and by using Web 
Caching Control Protocol (WCCP) developed by Cisco Systems. Includes one 
sidebar and one diagram. (MEM) 
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The management of global (shared) files can have a big impact on the 
performance of distributed computer systems. In this dissertation effort is 
devoted to the study of the file referencing process and to the development 
of algorithms to place and migrate files in distributed computer 
systems. The topic of assigning "files to the nodes of computer networks has 
received considerable attention in the pastf. Unfortunately, most of the 
work in this area has concentrated on the optimization of distributed 

file systems based on extremely simple models of the workload. This study 
is based on traces of activity collected from a number of real systems. The 
workload could be analyzed and algorithms adapted to it. Synthetic 
distributed systems were obtained from the real centralized systems that 
generated the traces. The SLAC and the Hughes systems were used. This 
partitioning of the users of a large centralized computer system into a 
number of smaller user communities was based on their utilization of shared 
files. The SLAC system was found to produce user subsets with a higher 
degree of overlapping (in terms of shared files) than the Hughes Aircraft 
installation. The file migration policies are introduced. First the single 
copy policies are presented. In this context a migration policy was 
developed that is optimal in the sense of minimizing the average network 
traffic . The existence of extended localities in the use of shared files 
suggested the use of multiple-copy policies in order to reduce the network 
traffic even further. A number of policies based on the Working Set model 
were studied. The best performing policies in terms of generated traffic 
are those that closely monitor the update traffic and that react to it by 
deleting the copies that are too expensive to maintain up to date. (ERA 
citation 07:060691) 
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Mathematical models 

Identifiers: ERDA/990200; NTISDE 
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Abstract: We present an architecture of a VOD system employing proxy 

servers. The proposed VOD system provides efficient and reliable VOD 
services and solves the problems caused by traditional VOD systems of 
centralized, hierarchical or distributed architecture. The proxy 
servers are placed between video servers and user systems. The proxy 
server is a small size video server that has not only caching function 
but also intelligence such as VCR-like video stream control or 
navigation of other proxy/video servers to search for a selected video 
program. Using a VOD system of the proposed architecture, the VOD 
services can be provided to more users because it reduces the workload 
of video servers and network traffic , me provide the 
performance model of the system. Service availability is also analyzed. 
The proposed architecture shows better performance and availability 
than the traditional VOD architectures. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To realize communication of a server /client type 
application program with more excellent response performance that can cope 
with increase in a communication delay and reduction in a communication 
trcuisfer speed due to factors such as use of a communication medium with 
a large delay and at a slow transfer rate, increase in number of 
intermediate communication passing places or increase or the like in a 
physical distance of a communication path. 

SOLUTION: A communication characteristic between a server program 203 on 
a server machine and a client machine 206 is measured, and when the 
communication characteristic does not satisfy a prescribed communication 
characteristic criterion, at least part of the server program 203 is 
copied to a server object machine close to the client machine 206 and 
the client program on the client machine 206 is connected to the copied 
server program 202. 
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NOVELTY - The identification information of relay devices between 
each observed candidate position for installing the server , to the 
user terminal position is acquired, based on which the shortest 
distance between candidate and user terminal positions, is identified. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
following : 

(1) Position selection system; and 

(2) Recorded medium storing position selection program. 

USE - For providing information delivery service by HTTP using 
several web server , especially when the service provided includes 
voluminous information to be transferred , like file delivery, 
video streaming, etc., in internet, 

ADVANTAGE - Frequency of packet transfer in position selection 
process, is reduced. Thus reducing estimation process duration and 
network load. Presumption of more suitable vicinity position is 
possible, with reduced estimation error. The size of the path 
information table and server information table are small, which 
reduces required memory resource. 

DESCRIPTION OF DRAWING (S) - The figure shows the basic components 
of the position selection device. (Drawing includes non-English 
language text) . 
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The management of global (shared) files can have a ^i^^ ^^P^^^/J^ ^he 
Derformance of distributed computer systems. In this dissertation effort is 
devoted to the study of the file referencing process and to the development 
of algorithms to place . and migrate files in distributed computer 
system!. The topic of ^mm^''^^^^ to the nojJes of computer networks has 
received considerable" attention in the past. Unfortunately, most of the 
work in this area has concentrated on the optimization of distributed 

file systems based on extremely simple models of the workload. This study 
is based on traces of activity collected from a number of real systems^ The 
workload could be analyzed and algorithms adapted to it. fX^^hetic 
distributed systems were obtained from the real centralized systems that 
Generated the traces. The SLAC and the Hughes systems were used. This 
partitioning of the .users of a large centralized computer system into a 
number of smaller user communities was based on their utilization of shared 
files The SLAC system was found to produce user subsets with, a higher 
degree of overlapping (in terms of shared files) than the Hughes Aircraft 
installation. The file migration policies are introduced. First the single 
copy policies are presented. In this context a migration policy was 
developed that is optimal in the sense of minimizing the average network 
traffic The existence of extended localities in the use of shared files 
suggested the use of multiple-copy policies in order to reduce the network 
traffic even further. A number of policies based on the Working Set model 
were studied. The best performing policies in terms of generated traffic 
are those that closely monitor the update traffic and that react to it by 
deleting the copies that are too expensive to maintain up to date. (ERA 

citation 07:060691) . 
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